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Abstract —Traditionally power distribution networks are either 
not observable or only partially observable. This complicates 
development and implementation of new smart grid technologies, 
such as those related to demand response, outage detection and 
management, and improved load-monitoring. In this two part 
paper, inspired by proliferation of metering technology, we dis¬ 
cuss estimation problems in structurally loopy but operationally 
radial distribution grids from measurements, e.g. voltage data, 
which are either already available or can be made available with 
a relatively minor investment. In Part I, the objective is to learn 
the operational layout of the grid. Part II of this paper presents 
algorithms that estimate load statistics or line parameters in 
addition to learning the grid structure. Further, Part II discusses 
the problem of structure estimation for systems with incomplete 
measurement sets. Our newly suggested algorithms apply to 
a wide range of realistic scenarios. The algorithms are also 
computationally efficient - polynomial in time - which is proven 
theoretically and illustrated computationally on a number of test 
cases. The technique developed can be applied to detect line 
failures in real time as well as to understand the scope of possible 
adversarial attacks on the grid. 

Index Terms —Power Distribution Networks, Power Flows, 
Struture/graph Learning, Voltage measurements, Transmission 
Lines. 


I. Introduction 

The power grid is composed of a network of transmission 
and distribution lines that enable the transfer of electrical 
power from generators to loads. The design, operation and 
control of these networks is typically hierarchical with a major 
division occurring between the transmission network of high 
voltage lines connecting sub-stations and power plants, and 
the distribution network of medium and low voltage lines that 
connect the transmission sub-stations to the end-users. Here, 
we focus on distribution networks. 

The design of distribution networks may appear to be loopy 
or meshed, however for practical engineering concerns, the 
vast majority of distribution grids are operated as ’’radial” 
networks, i.e. as a set of non-overlapping trees. Switches 
in the network are used to achieve one radial configuration 
out of many possibilities. Each tree in the network has a 
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substation at the root and customers positioned at the other 
nodes. Switching from one tree-like operational configuration 
to another is typically caused by system upsets, e.g. faults and 
outages, and may occur few times a day or even an hour. 

The radial configuration distinguishes distribution networks 
from transmission networks that generally have multiple loops 
energized all the time to guarantee continuous delivery of 
power to every node, even in case of occasional line faults 
and outages. Radial configurations and one-way flow of power 
have led to much less monitoring, observability, and state 
estimation in distribution as compared to meshed transmis¬ 
sion networks 0 - The recent proliferation of smart grid 
technology, including smart meters that measure electricity 
consumption at the node level, is creating a new opportunities 
to extract information important to grid operators and planners. 
Such efforts are also getting additional attention in view of 
mounting concerns over data security and protection of user 
privacy 0. 

In this paper (Part I), we seek to develop low-complexity al¬ 
gorithms to learn the current operational structure in ‘radial’ 
distribution networks using only nodal measurements. Nodal 
measurements may include voltage magnitudes, voltage phase 
(potentially), and power injections and are typically available 
at smart meters, pole-mount or pad-mount transformers, and 
distribution phasor measurement units (PMU). Accurate struc¬ 
tural estimation impacts many important applications including 
failure identification 0> outage management, and recovery 
following major and minor disruptions (e.g. hurricanes to 
individual lightning strikes), grid reconfiguration pj for power 
flow optimization and generation scheduling 0,0-0, and 
quantifying the need for additional meter placement. From 
an adversarial viewpoint, our work can be viewed as low- 
intrusion learning by a rogue agent interested in estimating the 
grid structure for a data attack 0. 0. In the subsequent part 
(Part II), we will look at developing algorithms that are able to 
estimate the statistics of power consumption at the grid nodes 
or estimate the parameters of operational lines in addition to 
determining the grid’s radial structure. Further, we will analyze 
learning the operational grid structure with missing data, where 
observations from a subset of nodes are not available. 

A. Related Work 

Our work falls in the broad category of ‘graph learning’ 
problems that have been approached from different directions. 
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For general graphs and graphical models (9|, maximum- 
likelihood structure estimation has been researched in several 
papers by utilizing prior information such as sparsity of the 
parameter space G3- 03’ size of the graph neighborhood 
03’ etc. Techniques employed include both traditional convex 
optimization 03, 03 as well as greedy learning 03- For 
power grids, structure estimation techniques discussed in the 
literature can be classified based on the type of measurements 
available as well as assumptions made regarding grid structure 
and user behavior. In 01 - a maximum likelihood estimator 
(MLE) with regularizes for low-rank and sparsity is used 
to recover the grid structure using locational marginal prices 
(LMPs). In 03, a model using bus phase angles as a Markov 
random field for the DC power flow builds a dependency graph 
based approach to detect faults in grids. In work specific to 
radial distribution grids, 03 provides a structure identification 
algorithm that uses signs within the inverse covariance matrix 
(or concentration matrix) of voltage measurements to generate 
a minimum spanning tree. In |3j, topology identification with 
limited measurements in a distribution grid with Gaussian 
loads is used to design a machine learning (ML) estimate 
with approximate schemes. Our work uses ordering of second 
moments, not a ML approach, to reconstruct a radial grid 
sequentially from the leaves to the root, making it distinct 
from previous work. Our algorithm design is based on a linear 
coupled approximation for lossless AC power flow that is 
idealized but practical G3- G3 for analyzing distribution 
grids where the line and voltage characteristics limit the 
accuracy of traditional approximations. Unlike related work, 
our topology learning algorithm is agnostic to the load profile 
distributions or variability in line impedances and requires only 
a less restrictive assumption on the correlation of load profiles. 

The rest of the paper is organized as follows. We start the 
next section with a description of the distribution grid and 
summary of the learning problems discussed in the manuscript. 
Section III and Appendix |A| describe the linear coupled (LC) 
power flow model and its special case, the DC-resistive power 
flow model. Statistical trends in observed nodal measurements 
are discussed in Section |TH-C| Next, we use the derived results 
to design algorithms for learning the distribution grid structure 
using the power flow models in Section [V] Simulation results 
elucidating the performance of our algorithms on test distri¬ 


bution grids are presented in Section VI Linally, Section VII 


concludes and suggests future directions that will be explored 
in a subsequent work. 


II. Technical Preliminaries 

The structure of a radial distribution network has important 
features that motivates our algorithm development. We discuss 
the radial structure in detail here and introduce the notation 
used in this paper. We then formulate the learning problem 
tackled in this paper in terms of its input data and deliverables 
and discuss the underlying motivation. 


A. Structure of Radial Distribution Network 

We consider a meshed distribution network which is op¬ 
erated as a union of non-intersecting ‘radial’ trees, i.e. a 



Fig. 1. Schematic layout of an example of a distribution grid fed from 4 
substations, with substations represented by large red nodes. The operational 
grid has a base-4 spanning forest configuration that is formed by solid lines 
(black). Dotted grey lines represent open switches. Each tree in the forest has 
one substation at the root. The load nodes within each tree are marked with 
the same color. 


spanning forest, by configuring switches as shown in Lig. |T] 
There are exponentially many (in the number of switches) 
possible configurations of spanning forests. The grid-graph 
with all the switches closed is denoted 9 = (V, £), where V 
is the set of nodes of the graph and £ is the set of undirected 
edges of the graph. We denote nodes with single Roman letter 
subscripts a and undirected edges with pairs of Roman letter 
subscripts ( ab ). The operational grid is a forest denoted by 'J 
which spans all the nodes in V. Specifically, T is a special 
subgraph of 9 (T C 9) such that 

• T is a union of K non-overlapping trees covering all the 
nodes of the graph 

• Each tree contains exactly one of the K bases (substa¬ 
tions), J = U t= i 

The distribution system J is a ‘base-constrained spanning 
forest’ with operational edges £ where fi 7 C £. Table [I] 
provides other relevant notations (nomenclature) used through 
out this manuscript to denote various nodal and edge features 
of the grid 9 and the operational forest T. 


B. Problem Formulation and Contribution 

We consider large distribution grids where the utility (ob¬ 
server) is unsure of the grid configuration because of insuffi¬ 
cient or inaccurate switching data, perhaps caused by a recent 
system upset. Alternatively, we could take the point of view of 
a third party observer, who may be an aggregator or adversary, 
trying to extract the current forest configuration from available 
nodal measurements. We assume the current spanning forest 
configuration is kept intact sufficiently long for load profiles 
at grid nodes to attain a steady distribution (longer than the 
fluctuations but shorter than changes in the mean load). 

We assume that the observer has access to nodal measure¬ 
ments, but not edge measurements - an assumption consistent 
with the recent expansion of smart grid monitoring devices. 
Smart meters generally provide nodal voltages and power 
injections at fine spatial resolution, i.e. at the individual 
customer level, but they do not provide any edge flow data. 
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TABLE I 
Notation Table 


TeS 

V 

N 

K 

£ 

£ y 

M k 

£ T * 

path between a,b £L V Tt 
P 7 k 

c a 

b is a descendant of a 
b is the parent of a 

0a,V a 

e.v 

e a 

e 

Pa,q a 

p-.q 

fiafr ■ Pah - ^ab ■ %ab 

$,g,r,x 

Zy 

Cly 

Py 

Hy 

/)T: 

M 


a particular forest configuration 
of the physical distribution network 9 
vertex set of 9 

# of nodes other than sub-stations in V 
number of sub-stations in the network 
edge set of 9 

set of edges operational within 3 
tree within 3 containing 
the sub-station 

reduced incidence matrix of the tree 7^ 
set of edges in 7 £ 
set of nodes in 7^ 
subset of edges from 8? k s.t. each 
node with edge in the subset, except 
a and b, contributes exactly two edges 
path from a to slack bus in 7^ 
a contributes 8^ k 
(i ab ) E £a k and b contributes 8j k 
voltage phase and magnitude resp. at bus a 
I Vector of non-sub station voltage 
| phases and magnitudes resp. 

= 1 — v a , voltage deviation at node a 
— 1 — v, voltage deviation 
I resp. active and reactive power 
injection/consumption (+/—) at bus a 
Vector of non-sub station 
power injections/consumptions 
susceptance, conductance, resistance, 
reactance resp. of edge ( ab ) 
diagonal matrix of line susceptances, 

| conductances, resistances, reactances resp. 
matrix of second moments for variable y 
matrix of covariances for variable y 
vector of means for variable y 
reduced weighted Laplacian matrix 
with edge weights in y 
Descendants (including itself) of a in 7k 
set of unobserved nodes 


Additional instrumentation is emerging for pole-mount or pad- 
mount transformers 03’ however, these new devices still 
only provide nodal voltages and aggregated customer power 
injections^ Some edge flow data is available to utilities, 
however, this is generally at a few select locations in the 
distribution grid, e.g. at the substation/root node, voltage 
regulators, reclosers, or other major utility equipment. These 
select locations may also have nodal and edge data from 
another emerging technology, i.e. distribution grid PMUs [(20) . 
However, we continue to restrict our input data to nodal values, 
which is consistent with the new, ubiquitous sensing provided 
by smart meters. 

The nodal devices provide the observer with temporal 
samples of the nodal voltage magnitudes. The observer seeks 
to use these samples to learn the current configuration of 
switches that determine the ‘base-constrained spanning forest’. 
To supplement the voltage magnitude samples, the observer 
has historical information about statistics of the nodal con¬ 
sumption. 

*We use the term ‘p° wer injection', 'power consumption' and 'load' 
interchangeably to denote the power profile at each interior (non-substation) 
node of the distribution system. 


III. Power Flow Models and Statistical 
Correlations 

Our approach to the structure learning problem relies on 
linearized PF models on radial spanning forests that enable 
efficient reconstruction of the grid structure via a second- 
moment analysis. The most general of the two, termed the 
Linear Coupling (LC) model, ignores losses of active and 
reactive powers and consistently assumes small voltage mag¬ 
nitude and phase drops between connected nodes. For tree¬ 
like distribution grids, the LC-PF model becomes equivalent 
to the LinDistFlow PF model in (H), GD- The second 
model considered in the paper, coined the DC-resistive model, 
corresponds to the special resistance dominating case of the 
LC-PF model. These PF models are described in more detail 
in Appendix [A] 


A. Linear Coupled Power Flow (LC-PF) model 

As noted in Eqs. (25)26) in Appendix |A] the LC-PF model 
is derived from the general AC power flow model by assuming 
small voltage magnitude deviations and phase differences 
between neighboring buses in the grid. It is convenient to 
restate the linear equations in LC-PF model in matrix form 
as: 


p = Hg£+Hp6, q = Hp£-H g Q ( 1 ) 

where p, q. e and 0 are defined in Table |I] //„ and //p are the 
weighted graph Laplacian matrices associated with forest cF 
such that 

('Ec:(a,c)eB*Sac if b = 0 

H g {a,b) = < - g ab if (ab) £ (2) 

(0 otherwise 

//p has a similar structure with ^-weights replaced by 13- 
weights. The weighted graph Laplacians can be stated in terms 
of the directed incidence matrix M as 

H g = M r g s M, H$=M r $ J M (3) 

Here, g J and [3 r are diagonal matrices representing, respec¬ 
tively, line conductances and susceptances for edges within 
32 M is the edge to node directed incidence matrix of 32 
See Fig. [2] for an example. Every row m a i, in M is equal to 
±(e^ — e7) and represents the directed edge (ab), where the 
direction of an edge is chosen arbitrarily. e a £ R ,v 1 k is the 
standard basis vector associated with the vertex a, with 1 at 
the a ,h position and zero everywhere else. We can combine 
Eqs. 0 and express the complex power flows as: 

p + iq=M T (g 3r + i^)M(£-iQ) (4) 

Both //p and //„ are weighted graph Laplacians and are 
degenerate — showing K zero-eigenvalues associated with the 
freedom in fixing phase and voltage deviation (from nominal) 
at any node within each tree of the forest. It is natural to fix 
phases and voltages at the sub-stations making these ‘slack 
buses’ c/i for trees T* of the (operational) forest, 3 r such that 
0 ak = £a k = 0, for any k, 1 < k < K. Formally, elimination 
of the set of K sub-stations corresponds to elimination of K 
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components from all the vectors contributing Eqs. <[TJ, and re¬ 
duction of K rows and K columns from the weighted Laplacian 
matrices. All the eigenvalues of the resulting reduced graph 
Laplacian matrices are thus strictly positive. 

Without loss of generality, we will use the same notation 
for the original and reduced dimension variables 0,e,p and q 
and also refer to Eqs. ( |1|4| > as applied to the reduced vectors of 
dimension N x 1. We will also keep notations, //p and //„ for 
the reduced graph Laplacian matrices, and M for the reduced 
incidence matrix respectively. The reduced M has a block 
diagonal structure: M = diag(Mi ,M 2 , ■ • ■ ,Mg), where, is 
the invertible reduced incidence matrix of tree in 'f. Thus, 
M and correspondingly //p and H q are full rank, invertible and 
block-diagonal matrices. Inverting the linear non-degenerate 
Eqs. G we arrive at 

0 = lT p-M ~ x1 T q = Hfj x p -Hfj r q (5) 

E= p+M~ l x 3 'M- lT q=H~, l r p + H~ / l x q (6) 



nodes: abed edges 

1 -1 0 ol 1 
1 0 -1 0 2 
0 10-1 3 


(a) 


(b) 
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b 

c 
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edges. 
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2 

3 

’ 1 

-1 

0 
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' 1 

0 

1 

1 

0 

-1 

2 

M k ‘ = 

0 

0 

1 

0 

1 

0 

3 


1 

-1 

1 


nodes 

a 

b 

c 


(c) (d) 

Fig. 2. Illustration of the reduced incidence matrix construction on the 
example of a tree with 4 nodes, (a) Tree graph with four nodes (a.b,c,d) and 
three edges (1,2,3). (b) Complete directed incidence matrix. The “directed” 
freedom in choosing edge orientations is fixed as follows, 1 = (ab), 2 = ( ac ), 
3 = (bd). (c) The column corresponding to the slack bus (bus d) is removed in 
the reduced incidence matrix M k . (d) Inverse of the reduced incidence matrix 
M k . 


where H x / r = M 


T JJ- 


Mand H l/x = M r x :f 


M. r J and x 3 


are 


diagonal matrices representing, respectively, line resistances 
and reactances within the forest T. Their relation to g :y and 
(3 r are expressed in Eqs. (28 1 . 


B. Relations between second moments 


The real and reactive nodal power injections p and q in 
Eqs. (5]6! fluctuate because of exogenous processes, and their 
second moments are related by: 


e [ee r ] =h~]e WW X } X + HfjE[ qq T ]H; 

Hfj x E[pq r }Hfj r - H-'Elqp r ]H;; x 


-1 

T/r 


=► = H i/£p H i/ x +Hrfeaur 1 




- H7'XnnH7' ~ 


'1/j M“l/r 

Similarly, E £ + H^Y. q H;) x 


1 j x I Jc i 1 fr 


\/r PH 1 fx 


\/rP 1± \/r ' 

Eee =H l j 1 T. p H l j r - H l j x 

+ H \ ,%i H t lx~ H i /%p"\ j r 


(7) 


( 8 ) 


(9) 


These formulas are the basis for reconstruction/learning anal¬ 
ysis in the rest of the paper. 


IV. Trends in Second Moments over Tree 
Networks 

We now derive key results related to the second moments 
in voltage magnitudes that arise from the properties of the 
forest T. We denote the unique path from node a to the slack 
bus in tree Tj. by £j k . From [211, the inverse of the reduced 
incidence matrix of a tree has the following special structure: 


if edge r £ £ is directed 
along path from a to slack bus, 
M^ l (a,r) = < — 1 if edge r £ £j k is directed 

against path from a to slack bus, 
0 if edge r £ £a k 


( 11 ) 


Here, the direction of edge r = ( cd ) is specified by its 
representative row m cc t in the directed incidence matrix. For 
example, if m ct j = e^ e T d , the edge is directed from node c 
to node d, whereas for m C( j = e d - ej., the direction is from 
node d to node c. 

An immediate corollary of (11 1 is that M" 1 (a, r) = 0 if edge 
r and node a lie on separate trees within the forest T, a fact 
consistent with the block diagonal structure of M. Using CD 
in H g l =M M 1 , we derive for forest cF 


C. DC-resistive model 

The DC-resistive PF model (see Appendix [A]l is an extremal 
case of the LC-PF model realized when line reactance can be 
ignored in comparison with resistance (x/r —> 0). The relation 
between the statistics of active powers and voltage second- 
order moments deviations reduces to 

E [pp T ] = H s E^ T ]H g => E £ = H~%H~ X (10) 


//,, 1 (a.b) = 0 if a.b are on different trees 7 k and (12) 
H g 1 (a 1 b)= Y, M ~\ a x)g J * (r,r)M -1 (b,r) if a,b£7 k 

re£. 7 k 

Y —if a,b£7 k (13) 

Ti Ti &cd 

(cd)e£a k nv 


Thus, H~ l (a,b) is equal to the sum of the inverse conduc¬ 
tances of lines that are common to the paths from both nodes 
to the slack bus. If no such line exists, the corresponding entry 
in Hf 1 is 0. See Fig. 3(a) for illustration. Similar results hold 
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Fig. 3. Schematic layout of a distribution grid tree T^. The sub-station node 
represented by large red node is the slack bus. (a) Dotted lines represent the 
paths from nodes a and d to the slack bus. The common edges on those 
paths are edges (be) and (eO). Thus, H~ l (a,d) = l/gb e + 1/geO- (b) the set 
of descendants of a, denoted by D a k . Here, nodes a and c are descendants of 
both nodes a and b, while node d is not a descendant of a but only of node 
b. 


for other measurement matrices like 




Wntf' d 

0 otherwise. 


Let Da k denote the set of descendants of node a within the 
tree T^. We call 6 a descendent of a, if a lies on the (unique) 
path from b to the slack bus of 7/ ; , also including a itself in 
the set of its descendants. We call b the parent of a within 7k 
(there can only be one) if ( ab ) £ T/. and a is an immediate 


descendant of b as illustrated in Fig. 3(b) 
The following statement holds. 


Lemma 1. For two nodes, a and its parent b, in tree 7k 


H g 1 (a,c) — H g ! (6,c) 



if node c £ Dj k 
otherwise, 


05) 


Proof: For any node c which belongs to a tree not 
containing nodes a and b, Hf 1 (a,c)—Hf 1 (b,c) =0 according 
to (12). Now, focus on nodes contained, together with a and 
b, within 7^. Since b is c/’s parent, £j k = &7 k lj{ (a£>)} and we 
derive (also validating on the illustrative example in Fig. |3(b)[ ) 
that for any node c in tree T/., 


£j k p| £j k = £l k p| £j k if node c £ Dj k 

& n £ c T *= [^i k n e?] Ui(^)} if n ° de c e Djf 


resulting in Eq. ©■ ■ 

We now prove our main results regarding trends in second 
moments of deviations in voltage magnitudes (e) along any 
tree in the network. The results are conditioned on the follow¬ 
ing assumption regarding correlations in power injections at 
the non-substation buses. 

Assumption 1: For any two buses a and b drawing power 
from the same distribution sub-station, T, p (a 7 b ) > 0,E ? (a,6) > 
0,E p<? (a,6) > 0. 

Note that this assumption holds, in particular, if the overall 
node balance is such that each non-substation node a always 


consumes strictly more than it produces in active and reactive 
powers, i.e. p a < 0 ,q a < 0. This assumption is certainly true in 
any distribution grid with small and/or moderate penetration of 
renewables [|22j. However, the assumption is also reasonable 
for a system with significant penetration of generation which 
is still dominated in average by the consumption. 

Theorem 1. If node a f b is a descendant of node b within 
tree 7k, then for the DC-resistive model, E E (c/,a) > E £ (6,6). 


Proof: We first show that for any node a and its par¬ 
ent b, E e (a,a) > E E (6,6). Consider E £ (a,a) — Eg(a,6). From 
Eq. ( fT0| ), we derive 

Ee(a,a) -E e (a,6) (a,c)E p (c,rf) 

c,d 

(Hf\a,d)-Hf\b,d)) (16) 

(using Lemma [I]) =Y'F/r 1 (c/,c)E p (c,af)— \{d £ Dj k ) 

c,d Sab 


•E e (a,a) — E e (a,/?) >0 (using Assumption 1) 


(17) 


1 


Also,Eg (a, &)—E e (6,6) =£tf“ 1 (M)E P M)— >0 (18) 

Sab 


cED a 


Combining Eqs. < 17 1 and (18) we derive Eg (a, a) > 
Eg (a, 6) > Eg (6,6). Since node a is a descendant of node 6, 
there is a path a.c \, ...c r ,h, such that each node in the path 
is a parent of its predecessor. Then, we derive E £ (cz,cz) > 
E e (ci,ci) > ... > Eg(c r ,c r ) > Eg(6,6). ■ 

The following theorem is the LC-PF version of Theorem [I] 


Theorem 2. If node a f b is a descendant of node 6 on tree 
7^ in forest 7, then Eg(cz,cz) > E e (6,6) in the LC-PF model. 


Proof: Consider Eq. <§• Notice that the right hand 
side has four constituent terms (Hf}'L D Hf}, Hi} L a H7 }, 

v 1/r P 1/r’ \/x H 1/jc’ 

and We denote each of these terms 

by Eg where j £ {1,2,3,4}. For each individual term, applying 
Assumption 1 and the analysis in Theorem [I] we find that 
Eg(a,a) > Eg(6,6) if node a is a descendant of node 6, other 
than itself. Thus, the statement also holds for the sum. ■ 
We now focus on evaluating the term E[(e a — £&) 2 ], which 
is the expected value of the squared difference between two 
node voltage deviations (e). For any two nodes a and 6, the 
DC-resistive model yields: 

E[(e fl -e fc ) 2 ] =E e (a,a)—Eg(a,6)—Eg(6,a)+E e (6,6) 

(using ( fl6] l) =£//~ 1 (a,c)E p (c,<i) (H~ l (a,d) - Hf l (b,d)) 

c,d 

-Y J H g 1 {b,c)T. p (c 1 d) ( H g l (a 1 d)-Hf\b,d)) 

c,d 

= L (77g _1 (a,c) - H g l (6, c)) E p (c,d) 

c,d 

{Hf\a,d)-H s \b,d)) (19) 

Our next Lemma follows directly by applying Lemma [T] to 

Eq. <[g. 


Lemma 2. For two nodes, a and its parent 6 belonging to 
tree 7k, in the DC-resistive model, we derive E[(e fl — £/>) 2 ] = 
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For the LC-PF model, we evaluate the expression E[(e a — 
e b) 2 ] as well. In this case, for two nodes, a and its parent b, 
that lie on tree 7 k , we arrive at 
E[(e„-e fe ) 2 ] = E E (a,a) -T. e (a,b) - Z. e (b,a) +Z. e (b,b) 


Algorithm 1 Base Constrained Spanning Forest Learning: LC- 
PF Model_ 

Input: True 'L pi 'L q and L pq , m voltage deviation observations 
E J , 1 < j < m, all line resistances r and reactances x 


where E e is given by Eq. (|8j). Let E e and E 2 represent the 
symmetric terms WL^EpW^^and respectively in 

E e . Extending the result of Lemma |3] we derive 

A {a, a) - E{ (a,b) - E 2 (b, a) + E^ (b, b) = £ r 2 ab E p (c, d) 


C,d£Da k 


( 20 ) 


E 2 (a,a) -T.l(a,b) -E 2 (fc,a) + E 2 (b,b) = £*^,E 9 (c,d) 


1: Compute E £ (a,a) = LT =1 e a e a/ w for all nodes a. 

2: Undiscovered Set U <— {1,2, ...,A + /f}, Leaf Set L<—ty, 
Descendant Sets D a •<— {a}V nodes a. 

3: while ( U ^ <|)) do 
4: b* <^mzx. b€U 'L e (b,b) 

5: for all a G L do 

6: if 17=1 (e^ - 4*) 2 / m = UdeoAbMc.d) + 


Similarly, let E 2 = H l }'L pq H l } 
T 


wr/E^wr, 1 

1/r PH l/x 

Lemma [T| wit 


Eq. ©, we get 


c,d£D a k 

i 

Draw edge between nodes a and b 

(21) 

8 

D b * t— D b * (J D u 

and E e - E 3 e - 

9 

jL i — Lj — 


10 

end if 

terms in E e . Using 

11 

end for 


12 

L^LU{b*} 

= Pab^pq (c ■> d)x a b 

13 

end while 




c,d£D a 


= L*(a,a)-l£(a,b)-'Lt(b,a)+I%(b,b) (22) 

Combining Eqs. ( |20|2 1 122) we arrive at the following Lemma. 

Lemma 3. In the LC-PF model, E[(e a — £/,) 2 ] = 
L . deD -r k r 2 b 'L p (c,d) +x 1 ab 'L q {c,d) +2r ab x ab 'L pq (c,d), holds for 
a node a and its parent b belonging to the (operational) tree 
7 k . 


V. Learning Structure of Base-Constrained 
Spanning Forest 

Here, we propose Algorithm 1 to learn the structure of 
the distribution network using properties of voltage deviations 
for the LC-PF model. The polynomial time algorithm, based 
on the Theorems proved in the previous section, requires 
positivity of the correlation between nodal power injections 
(Assumption 1). The Algorithm is also agnostic to the prob¬ 
ability distribution of active and reactive power injections. 
To reconstruct the Base-Constrained Spanning Forest (£F = 
U J fc=i ! ....A'T'it), Algorithm 1 takes as input hw’ measurements 
of nodal voltage deviations. These measurements are used to 
create the empirical voltage deviation second moment matrix 
E e . The voltage deviations e/, at the substation nodes are 
assumed to be zero which implies the elements of the row 
E E (a, : )=() for each of the K substation nodes. The observer 
has prior information (or estimates using power injection 
measurements) for the true second moment matrix of power 
injections E p for non-substation nodes. 

Algorithm 1 Overview: We reconstruct each tree within the 
distribution grid forest sequentially moving from the leaves to 
the root nodes. At every stage, U represents the set of undis¬ 
covered nodes that are not part of the current reconstructed 
tree while L represents the set of ‘current leaves’ (nodes that 
are in the current reconstructed tree but with undiscovered 
parents). At each iteration. Step [4] selects the node b" from set 
U with the largest second moment of voltage deviation. Next 


Step [6] adds edges between node b* and nodes in set L of the 
growing tree using Lemma [3] In the ideal case when infinitely 
many voltage magnitude samples are collected, second-order 
moments of the power injections satisfy the relation in Lemma 
[3] However, we have a finite number of samples. Thus the 
presence of an edge is determined in Algorithm 1 by checking 
if the relative difference between the reals on the left and 
right sides of the condition in Step [6] is less than a predefined 
tolerance, x: 

I7 =1 (e'-e 2 »)> 

1 — --- x 

Lc.deDa r^p( c ,d) +x 1 ab 'L q (c 1 d)+2r ab x ab 'L pq (c 1 d) 

(23) 


Steps [9] and 12 update the set of current leaves L before 
repeating the reconstruction steps with a new undiscovered 
node. 


Algorithm Complexity: Ignoring complexity of computing 
the second moments in Steps [4] and [6] (part of the data 
pre-processing), there are N + K steps (N load nodes and 
K substation nodes) in the ‘while’ loop, and at most N 
comparisons in the ‘for’ loop for each node. Therefore, the 
worst-case complexity of this algorithm is 0(N 2 +NK). 

Note that we use LC-PF model in Algorithm 1. To design 
the DC-resistive version of Algorithm 1, the condition in Step 
[6] should be replaced with the result in Lemma [2] (Required 
modifications are straightforward and thus their description is 
omitted.) 


VI. Experiments 

We perform a set of numerical experiments to test and 
demonstrate the performance of Algorithm 1 in extracting 
the operational radial forest T from meshed “as-designed” 
distribution networks S- We remind the reader that the observer 
in Algorithm 1 has information of the full graph S, the 
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impedance (resistance and reactance) of all lines (operational 
or open) as well as the number of connected substation buses. 
Further, true second moments of active and reactive power 
injections at each non-substation node are assumed to be 
known. The set of measurements available as input with the 
observer comprises of deviations in voltage magnitudes (e) at 
the grid nodes. 

Table [II] summarizes the distribution grid test systems by 
the number of load busses, number of substation busses, and 
the number of tie switches. Additional information on these 
test systems (23 |-|25| can be found online at )26) . In normal 
operation, each test grid consists of nodes in a forest- or tree¬ 
like configuration 'J with open tie-switches. We construct the 
complete meshed network 9 by closing all the tie-switches. 
To test the scalability of our algorithms, we increase the 
number of possible forest configuration by introducing several 
additional non-operational lines into each system as noted in 
Table [H| Although these contribute to 9, they are kept open and 
do not contribute to power flows in the operation forest 3 r . The 
impedances (reactances and resistances) of the additional lines 
are generated by assigning random values uniformly between 
the minimum and maximum impedances of the operational 
lines. Fig. [4] displays the test networks 9 (solid and dashed 
lines) and the respective operational forests (F (solid lines 
only). 


TABLE II 

Summary of the tested distribution grids 


Test 

Case 

Number of buses / substa¬ 
tions / tie-switches 

Non-operational 
lines added 

Source 

bus\3 3 

13/3/3 

10 


23 


bus 29 1 

29/1/1 

20 


24 


6ih 83 1 1 

83/11/13 

30 


25 



For each numerical experiment on a grid from Table [II] we 
pick an operational spanning forest T by opening tie switches. 
We also choose the statistics of the injections at each node 
using Gaussian distributions, unless otherwise specified. These 
distributions are used to generate multiple power injection 
samples, and from each vector-valued sample, we solve a PF to 
compute voltages and phases at every node in the network with 
the voltages at the substations fixed (i.e. they are slack busses). 
Averaging over all PF solutions, we compute empirical cor¬ 
relations of voltage magnitudes and phases. Using only these 
correlations, we run our algorithms and compare the resulting 
reconstruction with the actual operational configuration. In 
the reconstruction, we assume the observer has access to the 
resistance and reactance of all the lines in 9- All powers and 
voltages are presented in per unit (p.u.) values. 

Figs. 5(a)(5(c) display the accuracy of Algorithm 1 for 
the different test grids from Table [II] The relative error is 
defined to be the number of mislabeled lines (connected 
when actually open and vice versa) divided by the size of 
the operational edge set. The relative error is averaged by 
computing many reconstructions using the same nodal power 


injection distributions. Different curves (colors) in Fig. 5(a) 
show the effect of changing the tolerance X in Eq. [23] 

The average fractional error in Figs. 5(a)(5(c)| decays ex¬ 





Fig. 4. Layouts of distribution grids, also showing operational cases tested, 
in accordance with description in the text and summary in Table |h] The red 
circles represent substations (marked as S). The blue circles represent load 
nodes that are numbered. Black lines represent operational edges, while dotted 
blue lines represent open tie-switches. The additional lines are represented by 

dotted green lines, (a) bus_ 13 3 test case (b) bus_ 29_1 test case (c) bus _83 11 

test case 


ponentially with the number of samples used by the observer 
in the learning algorithm. For the tolerance values considered 
here, the majority of structural errors arise due to connected 
nodes not satisfying condition (23 i and hence being labelled 
as open. The decay is then intuitive as an increase in the 
sample size makes empirical moments in voltage magnitudes 
approximate their true values better which in turn leads to an 
increase in the number of operational lines satisfying ( [23| and 
being correctly identified. On the other hand, if a sufficiently 
large value of x is used, condition ( |23| l will be relaxed and 
possibly be satisfied even by unconnected nodes. In such a 
case, a majority of errors will be recorded due to open lines 
being incorrectly labelled as operational. As errors of this type 
(open edges classified as operational) does not improve with 






































































the number of measurement samples, the average fractional 
errors will not decay with the sample size. This is elucidated in 
Fig. [6] where the bus_ 13_3 structure is learnt, in the presence 
of 50 non-operational lines. Note that for larger values of 
X in Fig. [6] the errors do not decay with the sample size 
whereas for smaller values, they do as justified in the preceding 
discussion. For larger sample sizes, a smaller value of x is 
indeed preferable as observed in Fig. [6] 

VII. Conclusions & Path Forward 

Accurate structural estimation of distribution grids is im¬ 
portant to many applications including failure identification, 
power flow optimization and estimation of state variables. In 
this manuscript, we have developed algorithms to learn the 
structure of a radial distribution grid using observed nodal volt¬ 
age magnitude measurements. We have used a Linear Coupled 
(LC) approximation that relates complex nodal voltages to the 
complex power consumptions to prove that second moments 
of nodal voltage magnitudes in radial distribution grids follow 
certain statistical structure/ordering. Our algorithm relies on 
these results to reconstruct the operational tree in a bottom up 
fashion - starting from the leaves and progressing to the root 
of the grid. 

The primary benefits of our approach are two-fold. First, our 
model is practical as voltage measurements are easily available 
at distribution grid nodes and individual devices. Second, the 
only assumption used regarding the statistics of the loads is 
the positivity of non-central correlation of nodal load profiles, 
which is natural for most distribution grids and more general 
than other assumptions discussed in the literature. We tested 
our algorithms on sample distribution grids and observed an 
exponential decay of average errors with increasing number 
of measurement samples. In the next part, we extend the 
work in this paper along the following directions: coupling 
structure learning with estimation of load statistics in the grid 
or estimation of line parameters in the grid, and learning grid 
structure even when measurement data is missing. Developing 
general algorithms for learning distribution grid operational 
forests that are not restricted to linearized power flow models 
remains a potential future direction of research. 


Appendix 


A. Basic Power Flows 


Note that the operational ‘base-constrained spanning forest’ 
T can be thought of as a spanning tree over an extended graph, 
where an (artificial) super node is introduced and connected 
with (artificial) lines to all the sub-station nodes. This trick 
allows, without a loss of generality, to limit our discussion in 
the following to spanning trees, thus replacing T by T. 

Let x = ( Zab = r ab + i x ab\{ a b) £ £) as the vector of complex 
line impedances in the grid, (i 2 = — 1.) Expressed in terms of 
the complex powers and potentials (voltages and phases) the 
Kirchoff laws over the operational (spanning tree) configura¬ 
tion T become 


Va £ V :P a = p a + iq a 


L ^a exp(f0/j) 

7* 

b:(ab)e£. 7 "“b 

( 24 ) 



(a) 



(b) 


0.8 

I 0.6 

1 

) 

) 

| 0.5 
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tolerance 10' 1 
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v --- 

""©."'O. 
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(O 

Fig. 5. Average fractional errors plotted against the number of measurement 
samples for the LC-PF model with different tolerance thresholds - corre¬ 
spondent to the Algorithm 2.(a), (b) and (c) show results for the bus_ 13_3, 

bus _29_1 and bus_ 83_11 models respectively. 


where the real valued scalars, v a and 0 a , characterize voltage 
magnitude and phase respectively at node a. We assume that 
the power within the system considered is balanced through 
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Fig. 6. Effect of threshold on the average fractional errors in Algorithm 1 
using the LC-PF model for the bus_\3_3 (with 50 additional non-operational 
edges). 


a slack bus, a = 0. The set of Eqs. (24), expressing potentials 


via complex powers injected at the nodes of the power graph, 
are called Power Flow (PF) equations. 


B. Linear Coupled (LC) Approximation of Power Flows 

We linearize the PF Eqs. © in the first order jointly 
over phase difference and voltage deviations (v a — 1 = E a ) 
from nominal, i.e. the two corrections are considered on equal 
footing. We arrive at the following set of equations: 


Pa — (P abi^a ^b) F gabifa £&)) ? 

b:(ab)GE 7 

Qa = ( gabif^a ~ $ab(fa ~ £&)) 

b:(ab )£& 7 

where Va £ V,V(ab) £ £ T : |e a | <C 1, |0 fl — 9&| <C 1, 

fab n • %ab 


gab — 


xlu + rl 


5 P ab — 


x lb + r lb 


(25) 

(26) 

(27) 

(28) 


'■ab ^' ab 

Eqs. ( 25|26| ) show coupling between phases and voltages, 
thus calling it the Linear-Coupled (LC) approximation, is 
proper. The linearity of LC-PL is used in the paper in deriving 
results to learn the grid structure. Two comments are in order. 
Lirst of all, notice that the LC-PL approximation does not 
make any assumption about the relative strength of inductance 
and resistance, thus making it applicable to power distribution 
systems where the two line characteristics are typically of the 
same order. Second, expressions under the sum on the rhs of 
Eqs. ( 25|26 1 represent active and reactive power flows which 
are antisymmetric (p a ^ b = -p b -+ a , q a -+b = ~qb^a)- This 
emphasizes an important consequence of linear approximation 
in the LC-PL model - both active and reactive losses in lines 
are ignored as such losses occur at second order. 


C. DC-resistive approximation 

In low-voltage distribution grids line inductances may be 
much smaller in magnitude than line resistances. Then the 
LC-PL model can be simplified even further. Indeed, taking 
this case to the extreme where inductance can be ignored 


in comparison with the resistance, \/{a,b} : $ ab <C gab , 
we arrive at the following resistance-dominating version of 


Eqs. ( |25|26] > 


\/a G V . p a ~ gabifa £b)-)t}a ~ gab(®b ®a) 

b:(ab)E£.^ b:{ab)(zZi 7 


We will coin this approximation DC-resistive PL. Its formu¬ 
lation is similar to the traditional DC flow model (27) where 
active power flows are related to phase angles. The traditional 
DC model, used primarily for transmission networks, requires 
line inductances to dominate resistances, which is seldom 
observed in distribution grids. 


D. From Power Flows to DistFlow and LinDistFlow 

The DistLlow Eqs. introduced by Baran and Wu in (17) , 
118] are derived from the PL Eq. © 

„ _ r P 2 a^ b +q 2 q^b , 

Pa^b ?ab 7 — Pb ' 


T P 2 —>b "T Qa—tb _ 

Ha—tb X a b 7 — b 


Y Pb-^d 

(bc)(z8, 7 \c^a 

Y . Qb->d 

(bc)GE^\c=fca 


(29) 

(30) 


v b — v a — 2 {fabPa^b +X ab q a ^> b ) + (f 2 b + x lb) Pa ^ b 

^a 

(31) 


where each of the three equations above are stated in terms of 
active, p a ->b , and reactive, q a - >b, powers over directed lines, 
a —> b, and voltages over nodes, a £ V. If power losses at any 
line segment is negligible, Eqs. (29|30|3l) reduce to |4). 


Pa—>b ~ Pb + 'Yi Pb-t-ci tja—tb ~ 

tjb H" Yj Qb^ci 

(32) 

(bc)GE 7 

(lbc)eE 7 


c^a 

c^a 


g>b ~ tya 2 (rabPa^b “I - Xab^a—^b) i 

^a = V 2 a 

(33) 


On a general graph with loops, the number of these (directed) 
edge-related variables in Eq. (24) is larger then the number of 
phases and voltages. However, the former becomes equal to 
the later if the grid is a tree, where the number of edges is 
equal to the number of nodes minus one. Therefore when the 
phase and voltage at one special node (each substation node 
per tree in our case) is fixed at vo (say), phases and voltages 
at all other nodes can be reconstructed from the directional 
line flows, and vice versa. 

Note that additional boundary conditions arise in a tree 
due to the requirement of active and reactive power flowing 
from/into any leaf being equal to its nodal injection. Thus, 


Va £ Vo : v a = v 0 , (34) 

Vb £ V?, & (a -> b) £ E 7 : p a ^ b = p b , q a ^,b = qb, (35) 

where Vo C V is the set of K substations nodes (colored in red 
in Pig. Ill, and Vj J is the set of leaf-nodes in tree ‘T. Based on 
the assumption that voltage drop across any line segment is 
sufficiently small, we linearize Eq. (33) by substituting v a = 
1 +£ a , with |e a | <C 1 and have 

V (ab) £ £ . Z a Z b ~ ( r abPa-bb T x ab q a —^b) ■ (36) 
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Notice that the LinDistFlow Eqs. ( |32|36| ), derived assum¬ 
ing that the grid graph is a tree, are exactly equivalent to 
Eqs. ( 25|26|) of the Linear Coupled (LC) Approximation. 
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